4,314 research outputs found
A model-based multithreshold method for subgroup identification
Thresholding variable plays a crucial role in subgroup identification for personalizedmedicine. Most existing partitioning methods split the sample basedon one predictor variable. In this paper, we consider setting the splitting rulefrom a combination of multivariate predictors, such as the latent factors, principlecomponents, and weighted sum of predictors. Such a subgrouping methodmay lead to more meaningful partitioning of the population than using a singlevariable. In addition, our method is based on a change point regression modeland thus yields straight forward model-based prediction results. After choosinga particular thresholding variable form, we apply a two-stage multiple changepoint detection method to determine the subgroups and estimate the regressionparameters. We show that our approach can produce two or more subgroupsfrom the multiple change points and identify the true grouping with high probability.In addition, our estimation results enjoy oracle properties. We design asimulation study to compare performances of our proposed and existing methodsand apply them to analyze data sets from a Scleroderma trial and a breastcancer study
A Global SU(5) F-theory model with Wilson line breaking
We engineer compact SU(5) Grand Unified Theories in F-theory in which
GUT-breaking is achieved by a discrete Wilson line. Because the internal gauge
field is flat, these models avoid the high scale threshold corrections
associated with hypercharge flux. Along the way, we exemplify the
`local-to-global' approach in F-theory model building and demonstrate how the
Tate divisor formalism can be used to address several challenges of extending
local models to global ones. These include in particular the construction of
G-fluxes that extend non-inherited bundles and the engineering of U(1)
symmetries. We go beyond chirality computations and determine the precise
(charged) massless spectrum, finding exactly three families of quarks and
leptons but excessive doublet and/or triplet pairs in the Higgs sector
(depending on the example) and vector-like exotics descending from the adjoint
of SU(5)_{GUT}. Understanding why vector-like pairs persist in the Higgs sector
without an obvious symmetry to protect them may shed light on new solutions to
the mu problem in F-theory GUTs.Comment: 95 pages (71 pages + 1 Appendix); v2 references added, minor
correction
Frequency Domain Estimation of Continuous Time Cointegrated Models with Mixed Frequency and Mixed Sample Data
Recent work by the author on mixed frequency data analysis has focused on the estimation of cointegrated systems in continuous time based on a fully specified dynamic system of equations, while the estimation of cointegrating vectors in a discrete time system has been approached using a semiparametric frequency domain estimator. We extend the latter approach to cover the continuous time case, establishing the asymptotic properties of the frequency domain estimator and explore, in a simulation study, the effects of misspecifying the continuous time dynamic model in discrete time compared to treating the dynamics nonâparametrically. An empirical illustration is also provided
Heat transport in insulators from ab initio Green-Kubo theory
The Green-Kubo theory of thermal transport has long be considered
incompatible with modern simulation methods based on electronic-structure
theory, because it is based on such concepts as energy density and current,
which are ill-defined at the quantum-mechanical level. Besides, experience with
classical simulations indicates that the estimate of heat-transport
coefficients requires analysing molecular trajectories that are more than one
order of magnitude longer than deemed feasible using ab initio molecular
dynamics. In this paper we report on recent theoretical advances that are
allowing one to overcome these two obstacles. First, a general gauge invariance
principle has been established, stating that thermal conductivity is
insensitive to many details of the microscopic expression for the energy
density and current from which it is derived, thus permitting to establish a
rigorous expression for the energy flux from Density-Functional Theory, from
which the conductivity can be computed in practice. Second, a novel data
analysis method based on the statistical theory of time series has been
proposed, which allows one to considerably reduce the simulation time required
to achieve a target accuracy on the computed conductivity. These concepts are
illustrated in detail, starting from a pedagogical introduction to the
Green-Kubo theory of linear response and transport, and demonstrated with a few
applications done with both classical and quantum-mechanical simulation
methods.Comment: 36 pages, 14 figure
Goodness-of-Fit Tests for Symmetric Stable Distributions -- Empirical Characteristic Function Approach
We consider goodness-of-fit tests of symmetric stable distributions based on
weighted integrals of the squared distance between the empirical characteristic
function of the standardized data and the characteristic function of the
standard symmetric stable distribution with the characteristic exponent
estimated from the data. We treat as an unknown parameter,
but for theoretical simplicity we also consider the case that is
fixed. For estimation of parameters and the standardization of data we use
maximum likelihood estimator (MLE) and an equivariant integrated squared error
estimator (EISE) which minimizes the weighted integral. We derive the
asymptotic covariance function of the characteristic function process with
parameters estimated by MLE and EISE. For the case of MLE, the eigenvalues of
the covariance function are numerically evaluated and asymptotic distribution
of the test statistic is obtained using complex integration. Simulation studies
show that the asymptotic distribution of the test statistics is very accurate.
We also present a formula of the asymptotic covariance function of the
characteristic function process with parameters estimated by an efficient
estimator for general distributions
Wavefunctions and the Point of E8 in F-theory
In F-theory GUTs interactions between fields are typically localised at
points of enhanced symmetry in the internal dimensions implying that the
coefficient of the associated operator can be studied using a local
wavefunctions overlap calculation. Some F-theory SU(5) GUT theories may exhibit
a maximum symmetry enhancement at a point to E8, and in this case all the
operators of the theory can be associated to the same point. We take initial
steps towards the study of operators in such theories. We calculate
wavefunctions and their overlaps around a general point of enhancement and
establish constraints on the local form of the fluxes. We then apply the
general results to a simple model at a point of E8 enhancement and calculate
some example operators such as Yukawa couplings and dimension-five couplings
that can lead to proton decay.Comment: 46 page
Adaptive Evolutionary Clustering
In many practical applications of clustering, the objects to be clustered
evolve over time, and a clustering result is desired at each time step. In such
applications, evolutionary clustering typically outperforms traditional static
clustering by producing clustering results that reflect long-term trends while
being robust to short-term variations. Several evolutionary clustering
algorithms have recently been proposed, often by adding a temporal smoothness
penalty to the cost function of a static clustering method. In this paper, we
introduce a different approach to evolutionary clustering by accurately
tracking the time-varying proximities between objects followed by static
clustering. We present an evolutionary clustering framework that adaptively
estimates the optimal smoothing parameter using shrinkage estimation, a
statistical approach that improves a naive estimate using additional
information. The proposed framework can be used to extend a variety of static
clustering algorithms, including hierarchical, k-means, and spectral
clustering, into evolutionary clustering algorithms. Experiments on synthetic
and real data sets indicate that the proposed framework outperforms static
clustering and existing evolutionary clustering algorithms in many scenarios.Comment: To appear in Data Mining and Knowledge Discovery, MATLAB toolbox
available at http://tbayes.eecs.umich.edu/xukevin/affec
Structure in 6D and 4D N=1 supergravity theories from F-theory
We explore some aspects of 4D supergravity theories and F-theory vacua that
are parallel to structures in the space of 6D theories. The spectrum and
topological terms in 4D supergravity theories correspond to topological data of
F-theory geometry, just as in six dimensions. In particular, topological
axion-curvature squared couplings appear in 4D theories; these couplings are
characterized by vectors in the dual to the lattice of axion shift symmetries
associated with string charges. These terms are analogous to the Green-Schwarz
terms of 6D supergravity theories, though in 4D the terms are not generally
linked with anomalies. We outline the correspondence between F-theory topology
and data of the corresponding 4D supergravity theories. The correspondence of
geometry with structure in the low-energy action illuminates topological
aspects of heterotic-F-theory duality in 4D as well as in 6D. The existence of
an F-theory realization also places geometrical constraints on the 4D
supergravity theory in the large-volume limit.Comment: 63 page
Probabilistic Clustering of Time-Evolving Distance Data
We present a novel probabilistic clustering model for objects that are
represented via pairwise distances and observed at different time points. The
proposed method utilizes the information given by adjacent time points to find
the underlying cluster structure and obtain a smooth cluster evolution. This
approach allows the number of objects and clusters to differ at every time
point, and no identification on the identities of the objects is needed.
Further, the model does not require the number of clusters being specified in
advance -- they are instead determined automatically using a Dirichlet process
prior. We validate our model on synthetic data showing that the proposed method
is more accurate than state-of-the-art clustering methods. Finally, we use our
dynamic clustering model to analyze and illustrate the evolution of brain
cancer patients over time
- âŠ